On Deterministic Sketching and Streaming for Sparse Recovery and Norm Estimation

نویسندگان

  • Jelani Nelson
  • Huy L. Nguyen
  • David P. Woodruff
چکیده

We study classic streaming and sparse recovery problems using deterministic linear sketches, including `1/`1 and `∞/`1 sparse recovery problems, norm estimation, and approximate inner product. We focus on devising a fixed matrix A ∈ Rm×n and a deterministic recovery/estimation procedure which work for all possible input vectors simultaneously. We contribute several improved bounds for these problems. – A proof that `∞/`1 sparse recovery and inner product estimation are equivalent, and that incoherent matrices can be used to solve both problems. Our upper bound for the number of measurements is m = O(ε−2 min{logn, (logn/ log(1/ε))}). We can also obtain fast sketching and recovery algorithms by making use of the Fast Johnson-Lindenstrauss transform. Both our running times and number of measurements improve upon previous work. We can also obtain better error guarantees than previous work in terms of a smaller tail of the input vector. – A new lower bound for the number of linear measurements required to solve `1/`1 sparse recovery. We show Ω(k/ε 2 +k log(n/k)/ε) measurements are required to recover an x′ with ‖x − x‖1 ≤ (1 + ε)‖xtail(k)‖1, where xtail(k) is x projected onto all but its largest k coordinates in magnitude. – A tight bound of m = Θ(ε−2 log(εn)) on the number of measurements required to solve deterministic norm estimation, i.e., to recover ‖x‖2 ± ε‖x‖1. For all the problems we study, tight bounds are already known for the randomized complexity from previous work, except in the case of `1/`1 sparse recovery, where a nearly tight bound is known. Our work thus aims to study the deterministic complexities of these problems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nearly-optimal bounds for sparse recovery in generic norms, with applications to k-median sketching

We initiate the study of trade-offs between sparsity and the number of measurements in sparse recovery schemes for generic norms. Specifically, for a norm ‖ ·‖, sparsity parameter k, approximation factor K > 0, and probability of failure P > 0, we ask: what is the minimal value of m so that there is a distribution over m × n matrices A with the property that for any x, given Ax, we can recover ...

متن کامل

Large-scale Inversion of Magnetic Data Using Golub-Kahan Bidiagonalization with Truncated Generalized Cross Validation for Regularization Parameter Estimation

In this paper a fast method for large-scale sparse inversion of magnetic data is considered. The L1-norm stabilizer is used to generate models with sharp and distinct interfaces. To deal with the non-linearity introduced by the L1-norm, a model-space iteratively reweighted least squares algorithm is used. The original model matrix is factorized using the Golub-Kahan bidiagonalization that proje...

متن کامل

Sparse Recovery of Streaming Signals Using ℓ1-Homotopy

Most of the existing methods for sparse signal recovery assume a static system: the unknown signal is a finite-length vector for which a fixed set of linear measurements and a sparse representation basis are available and an `1-norm minimization program is solved for the reconstruction. However, the same representation and reconstruction framework is not readily applicable in a streaming system...

متن کامل

Sparse Recovery of Streaming Signals Using L1-Homotopy

Most of the existing methods for sparse signal recovery assume a static system: the unknown signal is a finite-length vector for which a fixed set of linear measurements and a sparse representation basis are available and an `1-norm minimization program is solved for the reconstruction. However, the same representation and reconstruction framework is not readily applicable in a streaming system...

متن کامل

Sketching and Streaming High-Dimensional Vectors

A sketch of a dataset is a small-space data structure supporting some prespecified set of queries (and possibly updates) while consuming space substantially sublinear in the space required to actually store all the data. Furthermore, it is often desirable, or required by the application, that the sketch itself be computable by a small-space algorithm given just one pass over the data, a so-call...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012